Privacy of Synthetic Data: A Statistical Framework
نویسندگان
چکیده
Privacy-preserving data analysis is emerging as a challenging problem with far-reaching impact. In particular, synthetic are promising concept toward solving the aporetic conflict between privacy and sharing. Yet, it known that accurately generating private, of certain kinds NP-hard. We develop statistical framework for differentially private data, which enables us to circumvent computational hardness problem. consider true random sample drawn from population $\Omega $ according some unknown density. then replace by much smaller subset ^{\ast}$ , we generate on reduced space fitting specified linear statistics obtained data. To ensure use common Laplacian mechanism. Employing Rényi condition number, measures how well sampling distribution correlated distribution, derive explicit bounds accuracy provided proposed method.
منابع مشابه
A Statistical Framework for Differential Privacy
One goal of statistical privacy research is to construct a data release mechanism that protects individual privacy while preserving information content. An example is a random mechanism that takes an input database X and outputs a random database Z according to a distribution Qn(·|X). Differential privacy is a particular privacy requirement developed by computer scientists in which Qn(·|X) is r...
متن کاملA centralized privacy-preserving framework for online social networks
There are some critical privacy concerns in the current online social networks (OSNs). Users' information is disclosed to different entities that they were not supposed to access. Furthermore, the notion of friendship is inadequate in OSNs since the degree of social relationships between users dynamically changes over the time. Additionally, users may define similar privacy settings for their f...
متن کاملA statistical analysis framework for bus reliability evaluation based on AVL data: A case study of Qazvin, Iran
Reliability is a fundamental factor in the operation of bus transportation systems for the reason that it signifies a straight indicator of the quality of service and operator’s costs. Todays, the application of GPS technology in bus systems provides big data availability, though it brings the difficulties of data preprocessing in a methodical approach. In this study, the principal component an...
متن کاملSIED, a Data Privacy Engineering Framework
While a number of data privacy techniques have been proposed in the recent years, a few frameworks have been suggested for the implementation of the data privacy process. Most of the proposed approaches are tailored towards implementing a specific data privacy algorithm but not the overall data privacy engineering and design process. Therefore, as a contribution, this study proposes SIED (Speci...
متن کاملA Privacy Engineering Framework for Big Data
In the Big Data era, the collection and mining of high-dimensional user data has become a fast growing and common practice by a large number of private and public institutions. In some cases, these institutions share data with third parties without an anonymity preferring user’s consent or awareness, thereby risking his privacy. In other cases, data is released voluntarily by a user not necessa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Information Theory
سال: 2023
ISSN: ['0018-9448', '1557-9654']
DOI: https://doi.org/10.1109/tit.2022.3216793